Debugging Scalable Applications on the XT
نویسنده
چکیده
Debugging at large scale on the Cray XT can involve a combination of interactive and non-interactive debugging; the paper will review subset attach and provide some recommendations for interactive debugging at large scale, and will introduce the TVScript feature of TotalView which provides for noninteractive debugging. Because many users of Cray XT systems are not physically co-located with the HPC centers on which they develop and run their applications, the paper will also cover the new TotalView Remote Display Client, which allows remote scientists and computer scientists to easily create a connection over which they can use TotalView interactively. The paper will conclude with a brief update on two topics presented at the previous two CUG meetings: memory debugging on the Cray XT and Record and Replay Debugging.
منابع مشابه
Scalable Tool Infrastructure for the Cray XT Using Tree-Based Overlay Networks
Performance, debugging, and administration tools are critical for the effective use of parallel computing platforms, but traditional tools have failed to overcome several problems that limit their scalability, such as communication between a large number of tool processes and the management and processing of the volume of data generated on a large number of compute nodes. A tree-based overlay n...
متن کاملSDB : A Novel Simulation-Based Debugger for Sensor Network Applications
Sensor network computing can be characterized as resourceconstrained distributed computing using unreliable, low bandwidth communication. This combination of characteristics poses significant software development and maintenance challenges. Effective and efficient debugging tools for sensor network are thus critical. Extant development tools, such as TOSSIM, EmStar, ATEMU and Avrora, provide us...
متن کاملDb : a Novel Simulation-based Debugg Er for S Ensor Network Applications *
S e nsor net wor k c omput i ng can be char act eri zed as resource-const r ai ned distributed computing using unreliable, low bandwidth communication. This combination of characteristics poses significant software development and maintenance challenges. Effective and efficient debugging tools for sensor network are thus critical. Existent development tools, such as TOSSIM, EmStar, ATEMU and Av...
متن کاملScalable performance analysis of large-scale parallel applications on Cray XT systems with Scalasca
The open-source Scalasca toolset (available from www.scalasca.org) supports integrated runtime summarization and automated trace analysis on a diverse range of HPC computer systems. An HPC-Europa2 visit to EPCC in 2009 resulted in significantly enhanced support for Cray XT systems, particularly the auxilliary programming environments and hybrid OpenMP/MPI. Combined with its previously demonstra...
متن کاملRuntime Checking of Multithreaded Applications with Visual Threads
Multithreaded applications are notoriously difficult to design and build while avoiding defects. Many of Compaq’s customers need to employ threads to implement high-performance, scalable applications that address their needs in business and science. In order to ensure their success using threads, Compaq provides a runtime debugging and analysis tool for multithreaded applications called Visual ...
متن کامل